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RECIPROCAL SUBTRACTION DIFFERENTIAL DISPLAY 

5 This application is a continuation-in-part of U.S. Serial 
No. 09/197,889, filed November 23, 1998, which is a 
continuation-in-part of U.S. Serial Application No. 
09/185,115, filed November 3, 1998 which is a 
continuation-in-part of U.S. Serial Application No. 
10 09/032,684, filed February 27, 1998. The content of the 
above identified applications are hereby incorporated 
into this application by reference, 

Throughout this application, various references are 
15 referred to within parentheses. Disclosures of these 
publications in their entireties are hereby incorporated 
by reference into this application to more fully describe 
the state of the art to which this invention pertains. 

20 Backgrou nd of the Invention 

Changes in gene expression are important determinants of 
normal cellular physiology, including cell cycle 
regulation, differentiation and development, and they 
directly contribute to abnormal cellular physiology, 

25 including developmental anomalies, aberrant programs of 
differentiation and cancer (1-4) . In these contexts, 
the identification, cloning and characterization of 
differentially expressed genes will provide relevant and 
important insights into the molecular determinants of 

30 processes such as growth, development, aging, 
differentiation and cancer. A number of procedures can 
be used to identify and clone differentially expressed 
genes. These include, subtractive hybridization (5-10), 
differential RNA display (DDRT-PCR) (3,4, 11,12), RNA 

35 fingerprinting by arbitrarily primed PGR (RAP-PCR) 
(13,14), representational difference analysis (RDA) (15), 
serial analysis of gene expression (SAGE) (16,17), 
electronic subtraction (18,19) and combinatorial gene 
matrix analyses (20) . 



Since first introduced by Liang and Pardee (11) , DDRT-PCR 
has gained wide popularity in analyzing and cloning 
differentially expressed genes. In DDRT-PCR, total RNAs 
or mRNAs from two or more cell types (or cells grown 
5 under different conditions, cells representing different 
stages of development, cells treated with agents 
modifying cellular physiology, etc.) are 
reverse-transcribed with two-base-pair anchored oligo dT 
primers, which divide mRNA populations into 12 cDNA 

10 subgroups. Then, each cDNA subgroup is amplified by PGR 
with one of 20 arbitrary 10-mer 5' primers and a 3' 
anchored primer and the PGR- amplified cDNA fragments are 
resolved in DNA sequencing gels. The combinations of 
primers are designed not only to yield a detectable size 

15 and number of bands, but also to display nearly the 
complete repertoire of mRNA species - 

DDRT-PGR is a powerful methodology in which a vast number 
of mRNA species (>20,000, if no redundancy occurs) can be 

20 analyzed with only a small quantity of RNA (about 5 
fig) (11) . DDRT-PGR is often the method of choice when the 
RNA source is limiting, such as tissue biopsies. A 
direct advantage of DDRT-PCR is the ability to identify 
and isolate both up- and down -regulated differentially 

25 expressed genes in the same reaction. Furthermore, the 
DDRT-PCR technique permits the display of multiple 
samples in the same gel, which is useful in defining 
specific diagnostic alterations in RNA species and for 
temporally analyzing gene expression changes. However, 

30 the DDRT-PCR technique is not problem free. Difficulties 
encountered when using standard DDRT-PCR include, a high 
incidence of false positives and redundant gene 
identification, poor reproducibility, biased gene display 
and lack of functional information about the cloned cDNA. 

35 Furthermore, poor separation can mask differentially 
expressed genes of low abundance under the intense 
signals generated by highly expressed genes. The 
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generation of false positives and redundancy can be 
highly problematic, resulting in an inordinate 
expenditure of resources to confirm appropriate 
differential expression and uniqueness of the isolated 
5 cDNAs, The cDNAs must be isolated from the gels in pure 
form (contamination of bands with multiple sequences 
complicates clone identification) , reamplif ied, placed in 
an appropriate cloning vector, analyzed for authentic 
differential expression and finally sequenced. These 
10 limitations of the standard DDRT-PCR approaches emphasize 
the need for improvements in this procedure to more 
efficiently and selectively identify differentially 
expressed genes , 

15 A number of modifications and improvements of the 
DDRT-PCR approach have been described (21-23) . Single 
anchor or degenerate two-base anchor oligo dT primers can 
be used to streamline the massive numbers of reverse 
transcription and PGR reactions required for validation 

20 of cDNAs as well as to reduce false positives (24,25). 
Reproducibility can be improved by lengthening the 
arbitrary 5 * primers to accommodate a convenient 
restriction site followed by two cycles of PGR with 
successive low- and high- stringency annealing 

25 temperatures (25,26). DDRT-PGR with inosine-containing 
5' arbitrary primers can also increase reproducibility of 
this approach (27) . However, since these modifications 
have only been analyzed using a subset of primers, 
further studies are necessary to validate these 

30 modifications of DDRT-PCR with additional primers and in 
several model systems. 

In addition to genomic DNA contamination, mispriming, PGR 
artifacts, the high incidence of false positives and 
35 redundancy is also ascribed to poor separation between 
bands and the complexity of the templates amplified (28) . 
Furthermore, poor separation can mask differentially 



expressed genes of low abundance under the intense 
signals generated by highly expressed genes. By 
enriching for unique cDNAs and removing common ones, it 
should in principle be possible to enrich for low 
abundant gene products and significantly decrease the 
complexity of amplified sequences. In addition, the 
sequence bias of DDRT-PCR should also be reduced by 
decreasing template complexity. These assumptions serve 
as the basis for the development of reciprocal 
subtraction differential RNA display (RSDD) . 

Subtractive hybridization, in which hybridization between 
tester and driver is followed by selective removal of 
common gene products, enriches for unique gene products 
in the tester cDNA population and reduces the abundance 
of common cDNAs (9) . A subtracted cDNA library can be 
analyzed to identify and clone differentially expressed 
genes by randomly picking colonies or by differential 
screening (29-31) . Although subtractive hybridization 
has been successfully used to clone a number of 
differentially e3q>ressed genes (5-7,10), this approach is 
both labor-intensive and does not result in isolation of 
the full spectrum of genes displaying altered expression 
(9,18). 

In principle, DDRT-PCR performed with subtracted RNA or 
cDNA samples represents a powerful strategry to clone up 
and down- regulated gene products. This approach should 
result in the enrichment of unique sequences and a 
reduction or elimination of common sequences. This 
scheme should also result in a consistent reduction in 
band complexity on a display gel, thereby permitting a 
clearer separation of cDNAs resulting in fewer false 
positive reactions. Additionally, it should be possible 
to use fewer primer sets for reverse transcription and 
PCR reactions to analyze the complete spectrum of 
differentially expressed genes. Of particular importance 
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for gene identification and isolation, rare gene products 
that are masked by strong common gene products should be 
displayed by using subtraction hybridization in 
combination with DDRT-PCR. In addition, the DDRT-PCR 
5 approach with subtractive libraries could also prove 
valuable for efficiently screening subtracted cDNA 
libraries for differentially expressed genes. However, 
even though subtraction hybridization plus DDRT-PCR 
appears attractive for the reasons indicated above, a 
10 previous attempt to use this approach has proven of only 
marginal success in consistently reducing the complexity 
of the signals generated, compared with the standard 
DDRT-PCR scheme (32) . 

15 We presently describe a reciprocal subtraction 
differential RNA display (RSDD) approach that efficiently 
and consistently reduces the complexity of DDRT-PCR and 
results in the identification and cloning of genes 
displaying anticipated differential expression. 



20 
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Summary of the Invention 

This invention provides a method for identifying 
differentially expressed nucleic acids between two 
samples, comprising: (a) selecting a first and second 
5 nucleic acid sample, wherein the nucleic acid samples 
contain a repertoire of nucleic acids; (b) performing 
reciprocal subtraction between the nucleic acid samples 
to produce two subtracted nucleic acid samples; (c) 
amplifying the two subtracted nucleic acid samples; and 
10 (d) comparing the two subtracted nucleic acid samples to 
identify differentially expressed nucleic acids. 

This invention also provides a method for identifying 
differentially expressed nucleic acids between two 

15 samples, comprising: (a) selecting a first and second 
nucleic acid sample, wherein the nucleic acid samples 
contain a repertoire of nucleic acids; (b) amplifying the 
two nucleic acid samples; (c) performing reciprocal 
subtraction between the amplified nucleic acid samples to 

20 produce two subtracted nucleic acid samples; and (d) 
comparing the two subtracted nucleic acid samples to 
identify differentially expressed nucleic acids. 

This invention further provides the above -described 
25 methods, wherein the first and second nucleic acid 
samples are obtained from cells in different 
developmental stages. 

This invention further provides the above -de scribed 
30 methods, wherein the first and second nucleic acid 
samples are obtained from cells from different tissue 
types. 

Also, this invention provides the above-described 
35 methods, wherein the 3' primer used in the PCR 
amplification is an oligo dT 3' primer. 
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In addition, this invention provides the above -described 
methods, wherein the 3' primer used in the PCR 
amplification is a single anchor oligo dT 3' primer. 

5 This invention also provides the above -described methods, 
wherein the comparing of step (e) comprises using a gel 
to separate the nucleic acids from both of the libraries. 

This invention provides the isolated nucleic acid 
10 identified by the the above -described methods, wherein 
the nucleic acid was not previously known to be 
differentially expressed between the two samples. 
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Brief Des cription of the Figures 

Identification of differentially expressed sequence tags 
5 using reciprocal subtraction differential RNA display 
(RSDD) . Left panel: differential RNA display pattern of 
conventional DDRT-PCR with RNA from Ell (C) and Ell-NMT 
(T) cells and an RSDD analysis of reciprocally subtracted 
Ell minus Ell-NMT (C/T) and Ell-NMT minus Ell (T/C) cDNA 
10 libraries. Right panel: representative RSDD patterns 
using different sets of primers. 

Fj-guy^ 2 

Reverse Northern analysis of differentially expressed 
15 sequence tags identified by reciprocal subtraction 
differential RNA display (RSDD) , Differentially expressed 
sequence tags obtained from RSDD were dot -blotted onto 
Nylon membranes and probed with 32P-cDNA reverse 
transcribed from RNA samples of Ell and Ell-NMT cells. 

20 

Figure 

Differential expression of representative progression 
elevated genes (PEGen) and progression suppressed genes 
(PSGen) identified by reciprocal subtraction differential 
25 RNA display (RSDD) and reverse Northern blotting. 
Northern blots of Ell and Ell-NMT RNA samples were probed 
with radiolabeled (^^P) expressed sequence tags identified 
by RSDD and reverse Northern blotting. 

30 Figure 3B 

Differential expression of representative progression 
elevated genes (PEGen) and progression suppressed genes 
(PSGen) identified by reciprocal sxibtraction differential 
RNA display (RSDD) aiid reverse Northern blotting. 

35 

Al ^1 ^Tn^^ liM^ 

Differential expression of representative progression 
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elevated genes (PEGen) and progression suppressed genes 
(PSGen) identified by reciprocal subtraction differential 
RNA display (RSDD) and reverse Northern blotting. 
Northern blots of cells displaying various stages of 
5 transformation progression were probed with radiolabeled 
(^^P) expressed sequence tags identified by RSDD and 
reverse Northern blotting. The cell types used include, 
TTnproareaeed Ell {-), CREFxEll-NMT Fl (-) and 
CREFxEll-NMT F2 (-) somatic cell hybrids, EllxEll-NMT A6 
10 (-) somatic cell hybrid, EllxEll-NMT 3b (-) somatic cell 
hybrid, and Ell-NMT Aza Bl (-) and Ell-NMT Aza CI (-) 
5-azacytidine treated Ell-NMT clones; and Progressed 
Ell-NMT ( + ) , CREFxEll-NMT Rl ( + ) and CREFxEll-NMT R2 ( + ) 
somatic cell hybrids, EllxEll-NMT A6TD (+) nude mouse 
15 tumor derived somatic cell hybrid, EllxEll-NMT Ila ( + ) , 
Ell-Ras R12 (+) a Ha-ras transformed Ell clone and 
Ell-HPV E6/E7 { + ) an Ell clone transformed by the E6 and 
E7 region of HPV-18. 

20 PiquT-e 5 

cDNA fragment of PEGen 7 - 90% Homology to Human HPV16 
ElBP. (Sequence ID No. 1) 

Figure 6 

25 cDNA fragment of PEGen 8 - Rat phosphof ructose kinase C. 
(Sequence ID No. 2) 

Figure 7 

First (Sequence ID No. 3) and second (Sequence ID No. 4) 
30 cDNA fragments of PEGen 13. 

Figure 8 

cDNA fragment of PEGen 14. (Sequence ID No. 5) 
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Figure 9 

cDNA fragment of PEGen 15. (Sequence ID No. 6) 
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cDNA fragment of PEGen 21 which has 94% homology to mouse 
PIN 14. (Sequence ID No. 7) 



Figure 11 

cDNA fragment of PEGen 24. (Sequence ID No. 8) 
Piyure 12 

CDNA fragment of PEGen 26 - Rat poly ADP-ribose 
polymerase. (Sequence ID No. 9) 

Figure 13 

cDNA fragment of PEGen 28. (Sequence ID No. 10) 

Figure 14 

cDNA fragment of PEGen 42. (Sequence ID No. 11) 
Figure 15 

cDNA fragment of PEGen 43. (Sequence ID No. 12) 



Figure Ig 

cDNA fragment of PEGen 44. (Sequence ID No. 13) 
Figure 17 

cDNA fragment of PEGen 48. (Sequence ID No. 14) 



CDNA fragment of PSGen 1 which has 80% homology to B. 
taurus supervillin. (Sequence ID No. 15) 

Figiire Ig 

cDNA fragment of PSGen 2 which has 91% homology to human 
HTLV-1 Tax interacting protein. (Sequence ID No. 16) 



Figure 20 

cDNA fragment of PSGen 4 
(Sequence ID No. 17) 



Rat proteasome activator. 
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cDNA fragment of PSGen 10 - Rat Ferritin Heavy Chain 
(Sequence ID No. 18) 

5 Figure 22 

cDNA fragment of PSGen 12. (Sequence ID No. 19) 

Figure 23 

cDNA fragment of PSGen 13. (Sequence ID No. 20) 

10 

Figure 24 

cDNA fragment of PSGen 23. (Sequence ID No. 21) 

Figure 25 

15 cDNA fragment of PSGen 24. (Sequence ID No. 22) 
Figure 2? 

cDNA fragment of PSGen 25. (Sequence ID No. 23) 

20 Fig^ye 27 

cDNA fragment of PSGen 26. 

Ficnire 28 

cDNA fragment of PSGen 27. 

25 

Figure 29 

cDNA fragment of PSGen 28. 

Figure 30 

30 cDNA fragment of PSGen 29. 

Figure 31 

cDNA fragment of PEGen 32. 
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Schematic outline of the reciprocal differential RNA 
display (RSDD) protocol. This scheme incorporates three 
steps, reciprocal subtraction of cDNA libraries, 
differential display of in vivo excised cDNAs and 
expression analysis by reverse Northern and standard 
Northern blotting. For the present application of RSDD, 
reciprocal subtraction hybridization was performed using 
libraries constructed from Ell and Ell-NMT cells, i.e.. 
Ell minus Ell-NMT and Ell-NMT minus Ell. Differentially 
expressed cDNAs identified on gels using differential RNA 
were isolated, reamplified and analyzed for expression by 
reverse Northern blotting. To confirm differential 
expression cDNAs were analyzed using Northern blotting. 

Figure 33 

Differential expression of representative progression 
elevated (PEGen) and progression suppressed genes (PSGen) 
identified by RSDD and reverse Northern blotting. 
Northern blots of Ell and Ell-NMT RNA samples were probed 
with radiolabeled (^^P) expressed sequence tags identified 
by RSDD and reverse Northern blotting. Equal loading of 
Ell and Ell-NMT RNA is demonstrated by ethidium bromide 
(EtBr) Staining . 

F^g^re 34 

Differential expression of representative PEGen and PSGen 
genes identified by RSDD and reverse Northern blotting in 
a large panel of rodent cells displaying differences in 
transformation progression. Northern blots of cells 
displaying various stages of transformation progression 
were probed with radiolabeled (^^P) expressed sequence 
tags identified by RSDD and reverse Northern blotting. 
The cell types used include: Unprogressed Ell (-) , CREF 
X Ell-NMT Fl (-) and CREF X Ell-NMT F2 (-) somatic cell 
hybrids, Ell X Ell-NMT A6 (-) somatic cell hybrid. Ell X 
Ell-NMT 3b (-) somatic cell hybrid, and Ell-NMT AZA Bl 
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(-) and Ell-NMT AZA CI (-) 5 -azacytidine- treated Ell-NMT 
clones; and Progressed Ell-NMT (+) , CREF X Ell-NMT Rl (+) 
and CREF X Ell-NMT R2 (+) somatic cell hybrids, Ell X 
Ell-NMT A6TD {+) nude mouse tumor derived somatic cell 
5 hybrid, Ell X Ell-NMT Ila ( + ) , Ell-Ras R12 ( + ) and 
Ell-HPV E6/E7 (+) an Ell clone transformed by the E6 and 
E7 region of HPV-18. Equal loading of RNAs is 
demonstrated by ethidium bromide (EtBr) staining, 

10 Pi mire 35 A 

PSGen 12 cDNA Seqeunce and PSGen 12 Protein Sequence 

Figure 35 B 

PSGen 13 cDNA Sequence and PSGen 13 Protein Sequence 

15 

Fig^ire 3^ C 

PEGen 28 cDNA Sequence and PEGen 28 Protein Sequence 

Fiigure 35P 

20 PEGen 32 cDNA Sequence and PEGen 32 Protein Sequence 
piqiire 35 E 

PEGen 42 cDNA Sequence and. PEGen 42 Protein Sequence 

25 pioiire 35 F 

PEGen 45 cDNA Sequence 

TT-jrfnre 35 G-1 and Figure 35 G-2 

PEGen 50 cDNA Sequence which are different parts of the 
30 gene. 

Figure 36 

PSGen 27 - Novel 
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P^t:aAle4 Pepcyiptjloff the Invention 

This invention provides a method for identifying 
differentially expressed nucleic acids between two 
samples, comprising: (a) selecting a first and second 
nucleic acid sample, wherein the nucleic acid samples 
contain a repertoire of nucleic acids; (b) performing 
reciprocal subtraction between the nucleic acid samples 
to produce two subtracted nucleic acid samples; (c) 
amplifying the two subtracted nucleic acid samples; and 
(d) comparing the two subtracted nucleic acid samples to 
identify differentially expressed nucleic acids . 

In an embodiment, the nucleic acid samples are mRNA or 
derived from mRNA. In another embodiment, the nucleic 
acid samples are total RNA. In another embodiment, the 
nucleic acid samples are cDNA- In another embodiment, 
the nucleic acid samples are a nucleic acid library. 

In an embodiment, differentially expressed nucleic acids 
are expressed at different levels. In a further 
embodiment, one of the nucleic acids is not expressed. 
In a different embodiment, one of the nucleic acids is 
expressed in truncated form. 

As used herein, reciprocal subtraction includes using 
nucleic acid sample A to subtract common nucleic acids 
from nucleic acid sample B (based on hybridization) and 
also using nucleic acid sample B to subtract common 
nucleic acids from nucleic sample A. In an embodiment, 
the complement of nucleic acid sample A is used to 
subtract nucleic acids from nucleic acid sample B and the 
complement of nucleic acid sample B is used to subtract 
nucleic acids from nucleic acid sample A. In a further 
embodiment, the RNA of nucleic acid sample A is used to 
subtract nucleic acids from nucleic acid sample B and the 
RNA of nucleic acid sample B is used to subtract nucleic 
acids from nucleic acid sample A. In yet another 
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embodiment, the cDNA of nucleic acid sample A is used to 
subtract nucleic acids from nucleic acid sample B and the 
cDNA of nucleic acid sample B is used to subtract nucleic 
acids from nucleic acid sample A. 

5 

As used herein, methods of amplification include PGR and 
rolling circle replication. 

A basic description of nucleic acid amplification is 
10 described in Mullis, U.S. Patent No. 4,683,202, which is 
incorporated herein by reference. The amplification 
reaction uses a template nucleic acid contained in a 
sample, two primer sequences and inducing agents. The 
extension product of one primer when hybridized to the 
15 second primer becomes a template for* the production of a 
complementary extension product and vice versa, and the 
process is repeated as often as is necessary to produce 
a detectable amount of the sequence . 

20 The inducing agent may be any compound or system which 
will function to accomplish the synthesis of primer 
extension products, including enzymes. Suitable enzymes 
for this purpose include, for example, E.coli DNA 
polymerase I, thermostable Tag DNA polymerase, Klenow 

25 fragment of E.coli DNA polymerase I, T4 DNA polymerase, 
other available DNA polymerases, reverse transcriptase 
and other enzymes which will facilitate combination of 
the nucleotides in the proper manner to form 
amplification products. The oligonucleotide primers can 

30 be synthesized by automated instruments sold by a variety 
of manufacturers or can be commercially prepared based 
upon the nucleic acid sequence of this invention. 

This invention also provides a method for identifying 
35. differentially expressed nucleic acids between two 
samples, comprising: a) selecting a first and second 
nucleic acid sample; b) producing libraries for the first 
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and second nucleic acid sample; c) amplifying the two 
libraries; d) performing reciprocal subtraction between 
the amplified libraries to produce two subtracted 
libraries; and e) comparing the two subtracted libraries 
to identify differentially expressed nucleic acids. 

This invention also provides a method for identifying 
differentially expressed nucleic acids between two 
samples, comprising: (a) selecting a first and second 
nucleic acid sample, wherein the nucleic acid samples 
contain a repertoire of nucleic acids; (b) amplifying the 
two nucleic acid samples; (c) performing reciprocal 
subtraction between the amplified nucleic acid samples to 
produce two subtracted nucleic acid samples; and (d) 
comparing the two subtracted nucleic acid samples to 
identify differentially expressed nucleic acids. 

This invention also provides the above -described methods, 
wherein the two siabtracted nucleic acid samples from step 
c are amplified prior to the comparing of step d. 

This invention also provides the above -described methods, 
wherein the each of the nucleic acid samples comprises a 
library of nucleic acids. 

This invention also provides the above -described methods, 
wherein the nucleic acid samples are obtained from total 
cellular RNA purified by hybridization with oligo (dT) . 

This invention also provides the above -described methods, 
wherein the nucleic acid samples are obtained from total 
RNA from Ell and Ell-NMT cells. 

Ell is an adenovirus-transformed rat embryo cell line 
that acquires an aggressive oncogenic progression 
phenotype when injected into athymic nude mice and 
reisolated in cell culture (Ell-NMT) . 
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This invention further provides the above -described 
methods, wherein the first and second nucleic acid 
samples are obtained from cells in different 
developmental stages . 

5 

This invention further provides the above -described 
methods, wherein the first and second nucleic acid 
samples are obtained from cells from different tissue 
types . 

10 

This invention further provides the above -described 
methods, wherein the first and second nucleic acid 
samples are obtained from cells that differ in their 
exposure to external factors or in their gene expression. 

15 

In an embodiment, cells that differ in their exposure to 
external factors or in their gene e3q)ression includes any 
cells that may have different levels of gene expression, 
wherein some genes may not be expressed at all . In 

20 another embodiment, cells that differ in their exposure 
to external factors or in their gene expression includes 
any cells that are likely to have different levels of 
gene e3q>ression, wherein some genes may not be expressed 
at all. In still another embodiment, cells that differ 

25 in their exposure to external factors or in their gene 
expression includes any cell that has a phenotypically 
recognizable difference. 

A short list of examples of cells that differ in their 
30 exposure to external factors or in their gene expression 
includes: cancerous versus normal cells, advanced cancer 
progression cells versus ealier cancer stage cells, 
diseased cells versus nondiseased cells, infected cells 
versus noninfected cells, later developmental stage cells 
35 versus earlier developmental stage cells, cells after DNA 
damage versus cells before DNA damage, senescent cells 
versus younger cells, cells induced by growth factors 
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versus cells not induced by growth factors, cells in the 
process of neurodegeneration versus normal cells, and 
cells exposed to a chemotherapeutic agent versus normal 
cells . 

As used herein, different tissues types include but are 
not limited to tissues containing: cells grovm under or 
exposed to different conditions, cells in different 
stages of development, cells treated with agents 
modifying cellular physiology, and cells having different 
functions. 

In an embodiment, cells at different stages of 
development are cells taken or analyzed at times 
differing by one or more hours in the development of the 
cell or organism, 

Further, this invention provides the above -described 
methods, wherein the anplifying of step (d) comprises PGR 
amplification. 

Also, this invention provides the above -described 
methods, wherein the 3* primer used in the PGR 
amplification is an oligo dT 3' primer. A few examples 
of oligo dT primers are T13, T13A, and T13GA. 

In addition, this invention provides the above -described 
methods, wherein the 3* primer used in the PGR 
amplification is a single anchor oligo dT 3' primer. 
Olgio dT 3' primers include T13A, T13C, and T13G. 

This invention provides the above -described methods, 
wherein the PGR amplification uses a set of random 
primers . 

This invention provides the above -described methods, 
wherein the 5' primer is an arbitrary primer. 
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This invention also provides the above-described methods, 
wherein the comparing of step (e) comprises using a gel 
to separate the nucleic acids from both of the 
substracted libraries. 

5 

In an embodiment, the gel is a polyacrylamide gel. In 
another embodiment, the gel is an agarose gel. 

This invention further provides the above -described 
10 methods, further comprising PGR amplifying the first and 
second nucleic acid samples. 

This invention also provides the above-described methods, 
further comprising reamplifying differentially expressed 
15 bands . 

This invention also provides the above -described methods, 
further comprising reamplifying differentially expressed 
nucleic acid. 

20 

In one method of reamplifying differentially expressed 
bands, differentially amplified bands from plasmids of 
each subtracted library were marked with an 18G needle 
through the film and cut out with a razor. The cut out 
25 differentially expressed barids can be reamplified (i.e. 
by PGR) and examined by reverse Northern and Northern 
blot analyses. 

In addition, this invention provides the above -described 
30 methods, wherein the comparing of step (e) comprises 
comparing the band intensities of the two amplified 
differentially expressed nucleic acids. 
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In addition, this invention provides the above -described 
methods, wherein the nucleic acid samples are mRNA or 
cDNA derived from mRNA. 
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In addition, this invention provides the above -described 
methods, wherein the comparing of step (e) comprises 
comparing the quantities of the two amplified 
differentially expressed nucleic acids. 

5 

This invention further provides the above -described 
methods, wherein the differences in band intensity 
between the two subtracted libraries are electronically 
quantified. 

10 

This invention further provides the above -described 
methods, wherein the differences in the quantities of 
nucleic acid between the two subtracted libraries are 
electronically quantified - 

15 

In one embodiment, electronic quantification involves 
using a scanner to detect the bands. In a further 
embodiment, computer software, such as Corel Draw, can be 
used to determine the pixel intensity of the scanned 
20 image, thereby quantifying the band intensity. 

Also, this invention provides the above -described 
methods, wherein the libraries of step (b) are 
constructed with X-ZAP cDNA library kits. One skilled in 
25 the art would recognize that any cDNA library would be 
suitable . 

This invention provides the isolated nucleic acid 
identified by the the above-described methods, wherein 
30 the nucleic acid was not previously known. 

This invention also provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PSGen 12 (AI 144569) . 

35 

In addition, this invention provides the above-described 
isolated nucleic acid, wherein the isolated nucleic acid 
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is the nucleic acid designated PSGen 13 (Accession No. AI 

144570) • 

This invention provides the above -described isolated 
5 nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PSGen 23. 

This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
10 nucleic acid designated PSGen 24. 

This invention provides the above-described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PSGen 25. 

15 

This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PSGen 26 (Accession No. AI 

144571) . 

20 

This invention also provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PSGen 27 (Accession No. AI 

144572) . 

25 

This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PSGen 28 (AI 144573) . 

30 This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PSGen 29 (AI 144574) . 
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This invention provides the above-described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PEGen 13 (AI 144564) . 
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This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PEGen 14 (AI 144565) . 

This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PEGen 15 . 

This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PEGen 24 (Accession No. AI 
144566) . 

This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PEGen 28 (AI 144567) . 

This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PEGen 32 (AI 144568) . 

This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PEGen 42, 

This invention provides the above-described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PEGen 43. 

This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PEGen 44. 

This invention provides the above -described isolated 
nucleic acid, wherein the isolated nucleic acid is the 
nucleic acid designated PEGen 48. 
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This invention further provides a previously unknown 
isolated nucleic acid molecule identified by the above- 
described methods which comprises (a) one of the nucleic 
acid sequences as set forth in Figure 35; (b) a sequence 
5 being degenerated to a sequence of (a) as a result of the 
genetic code; (c) a sequence encoding one of the amino 
acid sequences as set forth in Figure 35. (d) a sequence 
of at least 12 nucleotides capable of specifically 
hybridizing to the sequence of (a) , (b) or (c) . 

10 

Finally, this invention provides a purified polypeptide 
comprising one of the amino acid sequence as set forth in 
Figure 35- 

15 The sequences of the cDNA of PSGen 12, PSGen 13, PSGen 
26, PSGen 27, PSGen 28, PSGen 29, PEGen 13, PEGen 14, 
PEGen 24, PEGen 28, and PEGen 32 were submitted to 
GenBank and assigned with accession numbers AI 144569, AI 
144570, AI 144571, AI 144572, AI 144573, AI 144574, AI 

20 144564, AI 144565, AI 144566, AI 144567 and AI 144568 
respectively. 

This invention will be better understood from the 
Experimental Details which follow. However, one skilled 
25 in the art will readily appreciate that the specific 
methods and results discussed are merely illustrative of 
the invention as described more fully in the claims which 
follow thereafter. 

30 Experimental Details 

We presently describe a reciprocal . subtraction 
differential RNA display (RSDD) approach that efficiently 
and consistently reduces the complexity of DDRT-PCR and 
results in the identification and cloning of genes 

35 displaying anticipated differential expression. Proof of 
principle for the RSDD approach has come from its 
application for the identification of genes 
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differentially expressed during cancer progression. RSDD 
has resulted in the identification and cloning of genes 
displaying elevated expression in progressed tumor cells 
(PEGen) and reduced expression in progressed tumor cells 
(PSGen)\ The model used for RSDD was an 

adenovirus -transformed rat embryo cell line, Ell, that 
acquires an aggressive oncogenic progression phenotype 
when injected into athymic nude mice and reisolated in 
cell culture (Ell-NMT) (10,33,34). Injection of Ell 
cells into nude mice results in tumors in 100% of animals 
with a tumor latency time of approximately 35 to 40 days, 
whereas Ell-NMT cells form tumors in 100% of nude mice 
with a tumor latency time of 15 to. 20 days (10,34,35). 
Additionally, Ell cells form colonies in agar with an 
efficiency of "3%, whereas Ell-NMT display an agar 
cloning efficiency of >30% (10,33,34). The increased 
tumorigenicity and enhanced anchorage independence 
phenotypes are key indicators of tumor progression in the 
Ell/Ell-NMT model system (10,33,34). 

Differential RNA display was directly performed with 
reciprocally subtracted cDNA plasmid libraries (Ell minus 
Ell-NMT and Ell-NMT minus Ell) • Compared with the 
subtraction of PCR-amplif ied cDNA in Hakvoort et al., the 
subtracted cDNA libraries used in this experiment are 
free from potential PGR artifacts and provide more stable 
and consistent sources for DDRT-PCR analyzes. In 
addition, three single anchored oligo dT 3 ' primers were 
used instead of two -base -anchored approach described by 
Hakvoort et al (32) . To further streamline the DDRT-PCR 
procedure, reamplified cDNAs identified using RSDD were 
analyzed using the reverse Northern blotting procedure 
(35,36). cDNAs displaying differential expression by 
reverse Northern blotting were subsequently confirmed for 
true differential expression by Northern analysis. These 
modifications incorporated in the RSDD strategy result in 
an efficient approach for using subtractive. hybridization 
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and DDRT-PCR for identifying differentially expressed 
genes , 



5 Total RNA from Ell and Ell-NMT cells was isolated by the 
guanidinium isothiocyanate/CsCl centrif ugation procedure 
and poly A* RNA was purified with oligo(dT) cellulose 
chromatography (5) . Two A-ZAP cDNA libraries from Ell and 
Ell-NMT mRNA's were constructed with A-ZAP cDNA library 

10 Kits (Stratagene) following the manufacturer's protocol. 
Reciprocal subtraction between Ell and Ell-NMT libraries 
was performed and two subtracted cDNA libraries (Ell 
minus Ell-NMT and Ell-NMT minus Ell) were constructed as 
described previously. Bacterial plasmid libraries from 

15 the subtracted A- ZAP cDNA libraries were obtained by in 
vivo excision following the manufacturer's protocol 
(Stratagene) and the plasmids were isolated with Qiagen 
columns (Qiagen Inc.). 

20 The purified plasmids of reciprocally subtracted cDNA 
libraries were directly subjected to differential display 
as in Liang et . al . (38) with minor modifications. The 
plasmids of reciprocally subtracted cDNA libraries were 
PCR-amplif ied with the combination of three single-anchor 

25 3» primers (T 13 A, T 13 C or T 13 G) and 18 arbitrary 5' 
10-mer primers obtained from Operon Technology Inc. 
(Alameda, CA. OPA 1-20 except OPAl and 3). The 20 /il PGR 
reaction consisted of 10 mM Tris-HCl pH 8.4, 50 mM KCl, 
1.5 mM MgClz, 2 ^M each dNTP, 0,2 /zM 5' arbitrary primer, 

30 1 /iM 3' anchor primer, 50 ng of plasmid of a subtracted 
library, 10 /xCi a-^^S-dATP (3000 Ci/mmole from Amersham) 
and 1 U of Taq DNA polymerase (Gibco BRL) , The 
parameters of PGR were 30 sec at 95 C, 40 cycle of 30 sec 
at 95 C, 2 min. at 40 C and 30 sec at 72 C and additional 

35 5 min. at 72 C. After the cycling, 10 ^1 of 95% 
formamide, 0.05% bromophenol blue and 0.05% xylene cyanol 
were added to each PGR reaction. The mixture was heated 
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at 95 for 2 min and separated in a 5% denaturing DNA 
sequencing gel maintained at 50 ^C. PGR reactions of 
plasmids from each subtracted library in a primer set 
were run side by side. Differentially amplified bands 
from plasmids of each subtracted library were marked with 
an 18G needle through the film and cut put with a razor. 
The gel slice was put in 100 fxl TE pH 8.0 and incubated 
at 4 overnight. After the incubation, the mixture was 
boiled for 5 min and microcentrifuged for two min. The 
supernatant was collected and stored at -20 °C until 
reamplif ication. The band extract was reamplified with 
the same cycling parameters in a 50 nl reaction 
consisting of 10 mM Tris-HCl pH 8.4, 50 mM KCl, 1.5 mM 
MgClz/ 20 fiM each dNTP, 0.2 /xM 5' arbitrary primer, 1 fxM 
3' anchor primer, 5 /xl of band extract and 2 . 5 U of Taq 
DNA polymerase (Gibco BRL) . 

Differential expression of the reamplified DNA fragment 
was scrutinized by reverse Northern and Northeom blot 
analyses. In reverse Northern analysis, after 

confirmation in a 1% agarose gel, the reamplified DNA 
fragment (10 /xl of PGR reaction) was mixed with 90 fil TE 
and spotted on a positively charged Nylon membrane 
(Boehringer Mannheim) with a 96 -well vacuum manifold. 
The membrane was soaked with denaturing and neutralizing 
solution successively, and the spotted DNA was 
crosslinked to the membrane with a UV crosslinker 
(Stratagene) . ^^P- labeled first strand cDNA was prepared 
by reverse transcription of total RNA. After heating at 
70 for 10 min and quenching on ice for two min, 0.4 ;xM 
each T 13 A, T 13 G and T 13 C and 10 /xg total RNA mixture 
was added with 50 mM Tris-HCl, pH 8.3, 75 mM KCl, .3 mM 
MgCl2, 10 mM DTT, 0,5 mM dATP, 0.5 mM dGTP, 0 . 5 mM dTTP, 
0.02 mM dCTP, 0.5 ^1 RNase inhibitor (Gibco BRL), 100 fiCi 
dCTP (3000 Ci/mmole from Amersham) and 200 U Superscript 
RT II (Gibco BRL) in a final 25 /xl reaction. The 
reaction mixture was incubated at 42 for one hr and at 
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37 ""C for 30 min after addition of 2 fxl of RNase H (lOU, 
Gibco BRL) . The membrane was hybridized at 42 °C 
overnight in a 50% formamide hybridization solution. The 
hybridized membrane was washed at room temperature for 15 
5 min with 2X SSC containing 0.1% SDS twice and at 55 "^C for 
at least one hr with 0 . IX SSC containing 0.1% SDS, 
successively. The membrane was probed with the 

^^P- labeled cDNA of Ell, stripped off and probed with 
^^P- labeled cDNA of Ell-NMT. The signal intensity of each 
10 spot was normalized against that of GAPDH and compared 
between Ell and Ell-NMT. Reamplified DNA fragments 
displaying differential expression levels ^1.8 -fold 
higher between the two cell types were selected and 
analyzed by Northern blotting analysis. 

15 

In Northern blot analysis, 10 /xg of total RNA from Ell 
and Ell-NMT cells were run side-^by-side in a 1% agarose 
gel with formaldehyde and transferred to a positively 
charged Nylon membrane. Reamplif ication reaction (5 /xl) 

20 was ^^P- labeled with a multiprime labeling kit (Boehringer 
Mannheim) used to probe the membrane as described above, 
DNA fragments expressed differentially between Ell and 
Ell-NMT in Northern blot analyses were cloned into the 
Eco RV site of the pZEro-2.1 cloning vector (Invitrogene) 

25 and sequenced. In order to confirm differential 
expression, the cloned cDNA fragment was released by Eco 
RI -Xho I, ^^P- labeled and used to probe Northern blots as 
described above. Samples of RNAs from various Ell and 
Ell-NMT derivatives displaying either a progressed or 

30 suppressed progression phenotype, based on nude mice 
tumorigenesis and soft agar cloning assays were analyzed. 
These included Ell, Ell-NMT, CREF X Ell-NMT Fl and F2 
somatic cell hybrids (suppressed progression phenotype) , 
CREF X Ell-NMT Rl and- R2 . somatic cell hybrids 

35 (progression phenotype) , Ell X Ell-NMT A6 somatic cell 
hybrid (suppressed progression phenotype) , Ell X Ell-NMT 
A6TD tumors derived somatic cell hybrid (progression . 
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phenotype) , Ell X Ell-NMT 3b somatic cell hybrid 
(suppressed progression phenotype) , Ell X Ell-NMT 2a 
(progression phenotype) , Ell-NMT AZA Bl and CI 
B-azacytidine treated Ell-NMT clones (suppressed 
progression phenotype) , Ell-ras R12 clone containing the 
Ha-ras oncogene (progression phenotype) and Ell-HPV E6/E7 
clone containing the human papilloma virus -18 E6 and E7 
gene region (progression phenotype) . Differential 
expression of the PEGen and PSGen genes 'in the various 
cell types was confirmed using ^^P- labeled probes and 
Northern hybridization analysis. After reconfirmation of 
differential expression, the plasmids containing the 
differentially expressed DNA fragments were sequenced by 
the dideoxy sequencing procedure. 

Results and Diecussion 

Subtraction hybridization provides a direct means of 
enriching for unique cDNA species and eliminating common 
sequences between complex genomes . DDRT-PCR is a proven 
methodology for the rapid identification and cloning of 
differentially expressed sequences between cell types 
(3,4,22). In principle, subtraction hybridization 
combined with DDRT-PCR should reduce band complexity 
which often obscures the identification of differentially 
expressed genes and generates false positive signals 
(23,28). This strategy, RSDD, has been used to analyze 
genes differentially expressed during transformation 
progression. The differential RNA display pattern of Ell 
and Ell-NMT cells using standard differential RNA display 
DDRT-PCR) and RSDD is shown in Fig. 1 (Left Panel) - As 
predicted, the differential RNA display pattern of RSDD 
was much less complex than that of DDRT-PCR. The 
majority of bands common to both cDNA samples were 
eliminated using RSDD. These experiments demonstrate 
that subtractive hybridization prior to differential RNA 
display is effective in simplifying display patterns 
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permits the efficient identification of differentially- 
expressed cDNAs. Since RSDD significantly reduced the 
number of bands displayed, single anchor oligo dT 
primers, that can increase band numbers, were 
5 successfully used in subsequent applications of the RSDD 
approach (Fig. 1; Right Panel) . Using RSDD, 235 
differentially displayed cDNAs in the Ell/Ell-NMT tumor 
progression model system were isolated. 

10 Hakvoort et . al . (32) used a reciprocal subtraction 
approach to analyze gene expression changes resulting 
during liver regeneration following 70% hepatectomy, 
i.e., normal liver subtracted from partially 
hepatectomized regenerating liver and vice versa. 

15 Although some bands displayed apparent enrichment, the 
complexity of the display pattern did not show 
appreciable simplification. These results are in stark 
contrast to RSDD, which results in a clear delineation 
and simplification of differentially expressed amplified 

20 bands (Figs. 1). Although conceptually similar, RSDD is 
significantly more effective than the subtraction plus 
DDRT-PCR approach described by Hakvoort et al. (32) . The 
improved efficiency of RSDD versus the Hakvoort et al. 
(32) approach can be attributed to several factors. The 

25 approach of Hakvoort et al. (32) is based on the 
subtraction procedure described by Wang and Brown (38) . 
This approach involves multiple rounds of 
PCR-amplification prior to each round of subtractive 
hybridization. In contrast, RSDD involves a single round 

30 of reciprocal subtraction that does not involve PGR 
amplification (5,10). In this respect, the complicated 
display pattern observed by Hakvoort et al. (32) even 
after three or four rounds of subtraction might result 
from reduced subtraction efficiency, PGR artifacts or a 

35 combination of these problems. Increasing the number of 
reactions by using two-base pair anchored oligo dT 
primers did not reduce the complexity of displayed bands 



(32) . In these contexts, a critical component for the 
successful use of RSDD involves the use of an appropriate 
subtraction hybridization protocol, that can efficiently 
reduce cDNA complexity and generate stable populations of 
cDNAs for analysis. 

Previous studies demonstrate that different gene cloning 
strategies, including DDRT-PCR, subtraction hybridization 
and electronic display, identify dissimilar 
differentially expressed genes (18) . These results 
suggest that a single approach for gene identification 
may not identify the complete spectrum of differentially 
expressed genes (18) . Similarly, RSDD and DDRT-PCR do 
not resolve the same differentially expressed bands (Fig. 
1) . Unique bands identified in DDRT-PCR that were 
differentially expressed when analyzed by Northern 
blotting were not the same as those found using RSDD and 
vise versa. These results are not surprising, since, as 
indicated above, subtraction hybridization and 
differential RNA display identified distinct 
differentially expressed genes. Apparently, specific 
differentially expressed genes are lost during 
subtraction hybridization and differential RNA display of 
subtracted cDNAs. On the basis of these considerations, 
it will be essential to use multiple gene discovery 
approaches to identify and clone the complete spectrum of 
differentially expressed genes. 

DDRT-PCR can generate large numbers of differentially 
displayed bands making subsequent analysis both labor 
intensive and a daunting challenge. In order to reduce 
these limitations of DDRT-PCR, RSDD has been used in 
combination with reverse Northern analyses of isolated 
cDNAs. Gel extracted cDNA fragments were reamplified, 
dot -blotted on Nylon membranes and successively probed 
with reverse transcribed ^^P-cDNA from Ell or Ell-NMT RNAs 
(Fig. 2) . Signals were detected in 181 reamplified bands 
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out of 235 (77%) . This number is lower than that 

observed using DDRT-PCR (51 out of 54) • However, this 

■ 

comparison may not be accurate since only four arbitrary 
primers were used for DDRT-PCR and fewer differentially 
5 expressed bands were detected and isolated. A possible 
reason for the high incidence of false positives in RSDD 
may be due to the existence of foreign plasmid-like DNA 
in the cDNAs and the inaccurate reading properties of 
DDRT-PCR. 
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Table 1. Differentially Expressed cDNA Fragments 
Cloned by DDRT-PCR. 



5 



Nomenclature 


Identity 


Homology 


PEGen 41 


To be determined 




PEGen 42 


Novel 


Novel 


PEGen 43 


Novel 


Novel 


PEGen 44 


Novel 


Novel 


PEGen 45 


Hoxall locus antisense 


mouse 90% 


PEGen 46 


Glutamyl t-RNA synthetase 


human 59% 






Novel 


PEGen 50 


Novel 


Novel 


PSGen 1 


Supervillin 


B. 

taurxis 80% 


PSGen ^ 
PSGen 4 


XI X J-i V ~ X X oiJ^ xi.1 w cx u xAjy 

protein 

Proteasome activator 


human 9 1 % 
Rat 100% 


PSGen 27 


Novel 





The signal intensities of the various cDNAs in reverse 
30 Northern analysis were quantified and noirmalized against 
that of GAPDH, which remained unchanged in Ell and 
Ell-NMT cells. The PEG-3 (PEGen-3) gene (10) was used as 
an additional control, to verify increased expression in 
Ell-NMT versus Ell cells. In the reverse Northern 
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analyses, PEGen-3 levels were 4-fold higher in Ell-NMT 
than in Ell cells, which coincided with Northern blotting 
results, thereby demonstrating the concordance of reverse 
Northern and Northern assays. A ^ 1.8 -fold differential 
cut-off (after normalization for GAPDH expression) was 
used to identify and isolate cDNA bands displaying 
modified expression in Ell versus Ell-NMT cells. This 
resulted in the identification of 7 cDNAs with higher 
expression in Ell versus Ell-NMT cells and 65 cDNAs with 
elevated expression in Ell-NMT versus Ell cells. These 
results suggest that tumor progression in Ell-NMT cells 
correlates with the increased expression of a large 
number of genes, whereas only a smaller subset of genes 
display decreased expression. 

A problem present in DDRT-PCR, that is reduced but still 
can occur in RSDD, is the isolation of multiple cDNA 
species from what appears to be a single amplified band. 
When this occurs, these multiple species can produce 
spurious results when analyzed by reverse Northern 
analyses. For example, if two distinct species are 
isolated, one displaying modified expression and a second 
not displaying modified expression, an accurate estimate 
of differential expression will not be obtained by 
reverse Northern analysis. In this case, a number of 
potential false positives generated using reverse 
Northern analyses, may in reality not be false positives, 
but instead may represent multiple cDNAs. This problem 
may be ameliorated by performing single strand 
confoxmational polymorphism (SSCP) or reverse Northern 
analyses using cloned cDNA populations (39,40) . 

The expression pattern of representative RSDD-derived 
cDNAs in Ell versus Ell-NMT and in a more expanded 
Ell/Ell-NMT progression cell culture series is shown in 
Figs. 3 and 4, respectively. Reverse Northern results 
correlated well with Northern blots using Ell and 
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Ell-NMT (^80% concordance) or a larger panel of cells 
differentially displaying the progression phenotype, 
including progression negative, Ell, CREF x Ell-NMT Fl, 
CREF X Ell-NMT F2, Ell X Ell-NMT A6, Ell X Ell-NMT 3b, 
5 Ell-NMT Aza Bl and Ell-NMT Aza CI, and progression 
positive Ell-NMT, CREF X Ell-NMT Rl, CREF X Ell-NMT R2, 
Ell X Ell-NMT A6TD, Ell X Ell-NMT Ila, Ell-ras and 
Ell-HPV E6/E7. Sequence analysis of the various 
progression upregulated genes (PEGen) and progression 
10 suppressed genes (PSGen) identified both known and 
unknown genes (Table 2) • Known PEGen genes included 
PEGen 7 (HPV16 ElBP) , PEGen 8 (PFK-C) , PEGen 21 (FIN 14) 
and PEGen 26 (poly ADP-ribose polymerase) and a known 
PSGen gene was PSGen 10 (ferritin heavy chain) . Two 
15 PEGen genes out of six were found to be novel (PEGen 14 
and PEGen 24) and two PSGen genes out of three were found 
to be novel (PSGen 12 and PSGen 13) (Table 2) • 



Table 2. Differentially Expressed cDNA Fragments 
Cloned by RSDD 



Nomenclature Identity 

PEGen 7 

PEGen 8 

PEGen 13 

PEGen 14 

PEGen 15 

PEGen 21 

PEGen 24 

PEGen 26 

PEGen 28 

PEGen 32 
PSGen 10 

PSGen 12 

PSGen 13 

PSGen 23 



Homology 



HPV16 ElBP 



Human 90% 



PFK-C 



Rat 100% 



Novel 



Novel 



Novel 



Novel 



Novel 



Novel 



FIN 14 



Mouse 94% 



Novel 



Novel 



Poly ADP-ribose Polymerase Rat 100% 



Novel 



Novel 



Ferritin Heavy Chain 



Novel 



Novel 



Novel 



Novel 

Novel 
Rat 100% 

Novel 

Novel 

Novel 



PSGen 24 



Novel 



Novel 



PSGen 25 



Novel 



Novel 



PSGen 26 



Novel 



Novel 



PSGen 27 



Novel 



Novel 



PSGen 28 



Novel 



Novel 



PSGen 29 



Novel 



Novel 



PEGen 7 is expressed at ^ 5-fold higher levels in Ell-NMT 
than in Ell cells. PEGen 7 is ^90% homologous to 
16E1-BP, a cDNA encoding a protein identified using the 
yeast two-hybrid assay that interacts with human 
papillomavirus type 16 El protein (41) • 16E1-BP encodes 
a 432aa protein of unknown function but does contain an 
ATPase signature motif (Gly-X4-Gly consensus ATP binding 
motif at aa 179 through 186) . 16E1-BP appears to be a 
form of TRIP13, a protein previously shown to bind 
thyroid hormone receptor in yeast two-hybrid assays. The 
role of PEGen 7/16E1-BP in the progression phenotype in 
the Ell/Ell-NMT progression model is not known. 
Additional studies are necessary to determine if this 
gene change is associative or causative of transformation 
progression.. 

PEGen 8 is expressed at "^3- to 4- fold higher levels in 
Ell-NMT than in Ell cells. PEGen 8 shows 100% homology 
to rat phosphofructokinase C (PFK-C) (42) . PFK catalyzes 
the rate- limiting and committed step in glycolysis, the 
conversion of fructose 6 -phosphate to fructose 
1,6-biphosphate. Three subunit isozymes of PFK have been 
identified, that form homo- and heterotetramers with 
differing catalytic and allosteric properties. PFK-M is 



specific for cardiac and skeletal muscle, PFK-L is 
expressed in many tissues but is most abundant in the 
liver and PFK-C is expressed in several brain regions and 
the anterior pituitary but not in liver, skeletal muscle, 
or several other human tissues. The cDNA of PFK-C 
isolated from a rat hypothalamic cDNA library is 2643 bp 
and encodes a protein of 765aa (42) . In a recent study, 
Sanchez -Martinez and Aragon (43) demonstrated that PFK-C 
is the predominant form of PFK in ascites tumor cells 
(obtained from a transplantable mouse carcinoma of 
mammary origin) , whereas PFK-L is most abundant in the 
normal mammary gland. These results suggest the 
interesting possibility that PFK-C might contribute to 
the malignant nature of specific target cells. The role 
of PEGen 8/PFK-C in progression in the Ell/Ell-NMT model 
remains to be determined. 

PEGen 21 is expressed at ^"3- to 4 -fold higher levels in 
Ell-NMT than in Ell cells, PEGen 21 displays ""94% 
homology with the fibroblast growth factor- 4 inducible 
gene FIN-14 (44) . FIN-14 is a novel cDNA of unknown 
function that hybridizes with a 4.5 kb mRNA that is 
induced 4 -fold in NIH3T3 mouse cells following treatment 
with FGF-4 . The induction of FIN-14 occurs late (18 hr) 
after treatment with FGF-4 and does not occur when cells 
are treated for 18 hr with FGF-4 in the presence of 
cycloheximide (44). These results confirm that FIN-14 
encodes a late-inducible gene. Moreover, nuclear run-on 
assays document that FIN-14 is trancriptionally activated 
in NIH3T3 cells following growth factor stimulation. 
Tissue distribution studies indicate expression of a 
single mRNA species in the kidney with low levels of 
expression observed in several other tissues including 
testis and thymus. Mouse embryogenesis studies indicate 
that FIN-14 expression occurs constitutively in mouse 
embryos between day 10.5 and 15.5. Unlike NIH3T3, FIN-14 
was constitutively expressed in PC12 cells and its level 
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did not vary appreciably in response to growth factor 
stimulation. The role of PEGen 21/FIN-14 in progression 
in Ell/Ell -NMT model system is not currently known. 

The PSGen cDNAs, PSGen-12 and PSGen-13, consist of 
sequences without homology to those presently reported in 
various DNA databases. Expression of these cDNAs is ~3- 
to 4 -fold higher in Ell versus Ell-NMT cells (Fig. 3) . 
It is not currently known whether these genes simply 
correlate with or functionally regulate the progression 
phenotype. The identification of full-length cDNAs for 
PSGen-12 and PSGen-13 are in progress and once identified 
experiments can be conducted to directly define the role 
of these PSGen 's in cancer progression. 

We presently demonstrate that a modified differential RNA 
display technique, RSDD, can efficiently identify 
differentially expressed cDNAs. As predicted, 

subtractive hybridization prior to differential RNA 
display greatly reduces band complexity, a problem 
encountered in standard DDRT-PC3i in which RNA samples are 
directly analyzed without subtraction. Unlike a previous 
report using subtracted cDNAs processed through 
successive rounds of PGR (32,45) , common bands were 
eliminated using reciprocally subtracted cDNA libraries 
that had not been processed using PGR. In addition to 
subtraction hybridization, the discovery of 
differentially e3q>ressed genes was further streamlined by 
using reverse Northern analyses with isolated cDNAs. 
With 3 single anchored oligo dT primers and 18 arbitrary 
5* primers, 72 bands were identified that displayed 
differential expression using reverse Northern analysis. 
Currently, 40 of these cDNA species have been analyzed by 
Northern blotting and found to display differential 
expression in Ell versus Ell-NMT cells. Subsequent 
studies with the majority of these RSDD cDNAs 
demonstrated coordinated expression with the progression 
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phenotype in a large panel of unprogressed and progressed 
transformed cells. Current sequence analysis of the 
cloned cDNA fragments revealed 9 different genes, 
including 4 novel genes not reported in recent DNA 
databases. RSDD represents a method of choice either as 
a more efficient and less time consuming modification of 
the differential RNA display strategy or as a screening 
methodology for identifying differentially expressed 
genes in reciprocally subtracted cDNA libraries. 
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gecond Seri^P of Eypeyj^eatp 

Presently described is a RSDD approach that efficiently 
and consistently reduces the complexity of DDRT-PCR and 
5 results in the identification and cloning of genes 
displaying anticipated differential expression. The model 
used for RSDD was an adenovirus -transformed rat embryo 
cell line, Ell, that acquires an aggressive oncogenic 
progression phenotype when injected into athymic nude 

10 mice and reestablished in cell culture 
(Ell-NMT) (6,26,27) . Injection of Ell cells into nude mice 
results in tumors in 100% of animals with a tumor latency 
time of approximately 35 to 40 days, whereas Ell-NMT 
cells form tumors in 100% of nude mice with a tumor 

15 latency time of 15 to 20 days (6,26,27). Additionally, 
Ell cells form colonies in agar with an efficiency of ^^3 
%, whereas Ell-NMT display an agar cloning efficiency of 
>30% (6,26,27) . The increased tumor igeni city and enhanced 
anchorage independence phenotypes are key indicators of 

20 tumor progression in the Ell/Ell -NMT model system 
(6,26,27). RSDD has resulted in the identification and 
cloning of genes displaying elevated expression in 
progressed tumor cells (progression elevated gene, PEGen) 
and suppressed expression in progressed tumor cells 

25 (progression suppressed gene, PSGen) . 

MATgRIA^g AWP MBTHOPg 

RNA isolation and cDNA library construction. Total RNA 
from Ell and Ell-NMT cells was isolated by the 

3 0 guanidiniura isothiocyanate/CsCl centrifugation procedure 
and poly (A)* RNA was purified with oligo(dT) cellulose 
chromatography (5) • Two X-ZAP cDNA libraries from Ell and 
Ell-NMT mRNAs were constructed with A-ZAP cDNA library 
kits (Stratagene) following the manufacturer's protocol. 

35 Reciprocal subtraction between Ell and Ell-NMT libraries 
was performed and two subtracted cDNA libraries (Eil 
minus Ell-NMT and Ell-NMT minus Ell) were constructed as 
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described{5,6) . Plasmid cDNA libraries from the 
subtracted X-ZAP cDNA libraries were obtained by in vivo 
excision following the manufacturer's protocol 
(Stratagene) and the plasmids were isolated with Qiagen 
columns (Qiagen, Chatsworth, CAJ. 

RSDD methodology. The purified plasmids of reciprocally 
subtracted cDNA libraries were directly subjected to 
differential display as in Liang et al . (28) with minor 
modifications. The plasmids of reciprocally subtracted 
cDNA libraries were PC3i-amplif ied with the combination of 
three single-anchor 3' primers (T13A, T13C or T13G) and 18 
arbitrary 5' 10-mer primers obtained from Operon 
Technology Inc. (Alameda, CA. OPA 1-20 except OPAl and 
3) . The 20 fxl PCR reaction consisted of 10 mM Tris-HCl 
(pH 8.4), 50 mM KCl, 1.5 mM MgCla, 2 ^xM each dNTP, 0.2 fiM 
5' arbitrary primer, 1 /xM 3 ' anchor primer, 50 ng of 
plasmid of a subtracted library, 10 fxCi a-^^S-dATP (3,000 
Ci/mmol from Amersham) and 1 unit of Taq DNA polymerase 
(Gibco/BRL) . The parameters of PCR were 30 sec at SS'^C, 40 
cycles of 30 sec at SS^'C, 2 min at 40°C and 30 sec at 72°C 
and additional 5 min. at 72'*C. After the cycling, 10 /xl of 
95% formamide, 0.05% bromophenol blue and 0.05% xylene 
cyanol were added to each PCR reaction. The mixture was 
heated at 95®C for 2 min and separated in a 5% denaturing 
DNA sequencing gel maintained at 50**C. PCR reactions of 
plasmids from each subtracted library in a primer set 
were run side by side. Differentially amplified bands 
from plasmids of each subtracted library were marked with 
18G needle through the film and cut out with a razor. The 
gel slice was put in 100 /xl TE (pH 8.0) and incubated at 
4''C overnight. After the incubation, the mixture was 
boiled for 5 min and microcentrifuged for two min. The 
supernatant was collected and stored at -20®C until 
reamplif ication. The band extract was reamplified with 
the same cycling parameters in a 50 /xl reaction 
consisting of 10 mM Tris-HCl (pH 8,4), 50 mM KCl, 1.5 mM 



MgClj, 20 ixM each dNTP, 0.2 /zM 5' arbitrary primer, 1 /iM 
3' anchor primer, 5/il of band extract and 2.5 units of 
Taq DNA polymerase (Gibco/BRL) . 

Reverse Northern Blotting Procedure. Differential 
expression of the reamplified DNA fragment was 
scrutinized by reverse Northern and Northern blot 
analyses. In reverse Northern analysis, after 
confirmation in a 1% agarose gel, the reamplified DNA 
fragment (10 /xl of PGR reaction) was mixed with 90 /il TE 
and spotted on a positively charged Nylon membrane 
(Boehringer Mannheim) with a 96-well vacuum manifold. The 
membrane was soaked with denaturing and neutralizing 
solution successively, and the spotted DNA was 
crosslinked to the membrane with a UV crosslinker 
(Stratagene) . ^^P- labeled first strand cDNA was prepared 
by reverse transcription of total RNA, After heating at 
70^C for 10 min and quenching on ice for two min, 0.4 /xM 
each T13A, T13G and TSa C and 10 fxg total RNA mixture was 
added with 50 mM Tris-HCl, (pH 8.3), 75 mM KCl, 3 mM 
MgCla, 10 mM DTT, 0.5 mM dATP, 0.5 mM dGTP, 0.5 mM dTTP, 
0.02 mM dCTP, 0.5 ^1 RNase inhibitor (Gibco/BRL), 100 /xCi 
dCTP (3,000 Ci/mmol from Amersham) and 200 units 
Superscript RT II (Gibco/BRL) in a final 25 fil reaction. 
The reaction mixture was incubated at 42°C for one hoiir 
and at 37''C for 30 min after addition of 2 fil of RNase H 
(10 units, Gibco/BRL) . The membrane was hybridized at 42''C 
overnight in a 50% formamide hybridization solution. The 
hybridized membrane was washed at room temperature for 15 
min with 2X standard saline citrate containing 0.1% SDS 
twice and at 55**C for at least one hour with O.IX Standard 
Saline Citrate containing 0.1% SDS, successively. The 
membrane was probed with the ^^P-labeled cDNA of Ell, 
striped off and probed with ^^P-labeled cDNA of Ell-NMT. 
The signal intensity of each spot was normalized against 
that of glyceraldehyde- 3 -phosphate dehydrogenase and 
compared between Ell and Ell-NMT. Reamplified DNA 
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fragments displaying differential expression levels 
^1. 8-fold higher between the two cell types were selected 
and analyzed by Northern bloting analysis. 

5 Northern Blotting Analysis. In Northern blot analysis, 10 
fxg of total RNA from Ell and Ell-NMT cells were run 
side -by- side in a 1% agarose gel with formaldehyde and 
transferred to a positively charged Nylon membrane. 
Reamplif ication reaction (5 /il) was ^^P-labeled with a 
10 multiprime labeling kit (Boehringer Mannheim) used to 
probe the membrane as described above. DNA fragments 
expressed differentially between Ell and Ell-NMT in 
Northern blot analyses were cloned into the EcoRV site of 
the pZEro-2,1 cloning vector (Invitrogene) and sequenced. 

15 

To confirm differential expression, the cloned cDNA 
fragment was released by EcoRI-XhoI, ^^P-labeled and used 
to probe Northern blots as described above. Samples of 
RNAs from various Ell and Ell-NMT derivatives displaying 
20 either a progressed or suppressed progression phenotype, 
based on nude mice tumorigenesis and soft agar cloning 
assays were analyzed. These included Ell, Ell-NMT, C31EF 
X Ell-NMT Fl and F2 somatic cell hybrids (suppressed 
progression phenotype) , CREF x Ell-NMT Rl and R2 somatic 
25 cell hybrids (progression phenotype) , Ell x Ell-NMT A6 
somatic cell hybrid (suppressed progression phenotype) , 
Ell X Ell-NMT A6TD tumor-derived somatic cell hybrid 
(progression phenotype) , Ell x Ell-NMT 3b somatic cell 
hybrid (suppressed progression phenotype) , Ell x Ell-NMT 
30 Ila (progression phenotype) , Ell-NMT AZA Bl and CI 
5-azacytidine treated Ell-NMT clones (suppressed 
progression phenotype) , Ell-Ras R12 clone containing the 
Ha-ras oncogene (progression phenotype) and Ell-HPV E6/E7 
clone containing the human papilloma virus -18 E6 and E7 
35 gene region (progression phenotype) . Differential 
expression of the PEGen and PSGen genes in the various 
cell types was confirmed using ^^P- labeled probes and 
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northern hybridization analysis. After reconfirmation of 
differential expression, the plasmids containing the 
differentially expressed DNA fragments were sequenced by 
the dideoxy sequencing procedure. 

5 

RESULTS jOro DISCUSSION 

Subtraction hybridization providies a direct means of 
enriching for unique cDNA species and eliminating common 
sequences between complex genomes (7 , 18) . DDRT-PCR is a 

10 proven methodology for the rapid identification and 
cloning of differentially expressed sequences between 
cell types (1,2,28). In principle, subtraction 
hybridization combined with DDRT-PCR should reduce band 
complexity which often obscures the identification of 

15 differentially expressed genes and generates false 
positive signals (21,29). RSDD has been used to analyze 
genes differentially expressed during transformation 
progression (Fig, 28) . Differential RNA display was 
directly performed with reciprocally subtracted cDNA 

20 plasmid libraries (Ell minus Ell-NMT and Ell-NMT minus 
Ell) that had not been subjected to PGR, Three single 
anchored oligo dT 3 ' primers were used for subsequent 
amplification prior to display- To further streamline the 
DDRT-PCR procedure, reamplif ied cDNAs identified using 

25 RSDD were analyzed using the reverse Northern blotting 
procedure (30,31). cDNAs displaying differential 
expression by reverse Northern blotting were subsequently 
confirmed for true differential expression by Northern 
analysis . 

30 

The differential RNA display pattern of Ell and Ell-NMT 
cells using standard differential RNA display (DDRT-PCR) 
and RSDD is shown in Fig. 1 (Left Panel) . The 
differential RNA display pattern of RSDD is much less 
35 complex than that of DDRT-PCR. These experiments 
demonstrate that subtractive hybridization prior to 
differential RNA. display is effective in simplifying 
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display patterns permitting the efficient identification 
of differentially expressed cDNAs . Since RSDD 
significantly reduced the number of bands displayed, 
single anchor oligo dT primers, that can increase band 
5 numbers, were successfully used in subsequent 
applications of the RSDD approach (Fig. 1; Right Panel). 
Using RSDD, 234 differentially displayed cDNAs in the 
Ell/Ell-NMT tumor progression model system were isolated. 
Hakvoort et al.(25) used a reciprocal subtraction 
10 approach to analyze gene expression changes resulting 
during liver regeneration following 70% hepatectomy, 
i.e., normal liver subtracted from partially 
hepatectomized regenerating liver and vice versa. 
Although some bands displayed apparent enrichment, the 
15 complexity of the display pattern did not show 
appreciable simplification. In contrast, RSDD results in 
a clearer delineation and simplification of 
differentially expressed amplified bands (Figs. 1) . 
Although conceptually similar, RSDD is significantly more 
20 effective than the subtraction plus DDRT-PCR approach 
described by Hakvoort et al. (25) The reasons for the 
improved efficiency of RSDD versus the Hakvoort et al. 
(25) approach are not known. One possibility is that the 
differences between the experimental approaches may 
25 reflect the subtraction hybridization strategies 
employed. The approach of Hakvoort et al. (25) is based 
on the subtraction procedure described by Wang and Brown 
(32) . This approach uses multiple rounds of 
PGR- amplification prior to each round of subtract ive 
30 hybridization. In contrast, RSDD involves a single round 
of reciprocal subtraction without intermediate 
amplification (5, 6) . In this respect, the complicated 
display pattern observed by Hakvoort et al. (25) even 
after three or four rounds of subtraction might result 
35 from reduced subtraction efficiency, PGR artifacts or a 
combination of these problems. Increasing the number of 
reactions by using two-base pair anchored oligo dT 



-53- 

primers did not reduce the complexity of displayed bands 
(25) . In these contexts, a critical component for the 
successful use of RSDD involves the use of an appropriate 
subtraction hybridization protocol, which can efficiently 
5 reduce cDNA complexity and generate stable populations of 
cDNAs for analysis. 

Previous studies demonstrate that different gene cloning 
strategies, including DDRT-PCR, subtraction hybridization 

10 and electronic display, identify distinct subsets of 
differentially expressed genes (18). These results 
suggest that a single approach for gene identification 
may not identify the complete spectrum of differentially 
expressed genes. Similarly, RSDD and DDRT-PCR do not 

15 resolve the same differentially expressed bands (Fig. l) . 
Unique bands identified in DDRT-PCR that were 
differentially expressed when analyzed by Northern 
blotting were not the same as those found using RSDD and 
vise versa {data not shown) . These results are not 

20 surprising, since, as indicated above, subtraction 
hybridization and differential RNA display identified 
distinct differentially expressed genes (18) . Apparently, 
specific differentially expressed genes are lost during 
subtraction hybridization and differential RNA display of 

25 subtracted cDNAs. On the basis of these considerations, 
it will be essential to use multiple gene discovery 
approaches to identify and clone the complete spectrum of 
differentially expressed genes. 

30 DDRT-PCR can generate large numbers of differentially 
displayed bands making subsequent analysis both labor 
intensive and a daunting challenge. In order to reduce 
these limitations of DDRT-PCR, RSDD has been iised in 
combination with reverse Northern analyses of isolated 

35 cDNAs. Gel extracted cDNA fragments were reamplified, 
dot -blotted on Nylon membranes and successively probed 
with reverse transcribed ^^P-cDNA from Ell or Ell-NMT RNAs 
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(Fig. 2) . Signals were detected in 181 reamplified bands 
out of 234 (77%) . 

The signal intensities of the various cDNAs in reverse 
Northern analysis were quantified and normalized against 
. that of GAPDH, which remained unchanged in Ell and 
Ell-NMT cells. Progression elevated gene-3 (PEG-3) (6) was 
used as an additional control, to verify increased 
expression in Ell-NMT versus Ell cells. In the reverse 
Northern analyses, PEG-3 levels were 4 -fold higher in 
Ell-NMT than in Ell cells, which coincided with Northern 
blotting results, thereby demonstrating the concordance 
of reverse Northern and Northern assays. A > 1.8- fold 
differential cut-off (after normalization for GAPDH 
expression) was used to identify and isolate cDNA bands 
displaying modified expression in Ell versus Ell-NMT 
cells. This resulted in the identification of 7 cDNAs 
with higher expression in Ell versus Ell-NMT cells and 65 
cDNAs with elevated expression in Ell-NMT versus Ell 
cells. These results suggest that tumor progression in 
Ell-NMT cells correlates with increased expression of a 
large number of genes, whereas only a smaller subset of 
genes display decreased expression. 

A problem frequently encountered in DDRT-PCR, that is 
reduced but still can occur in RSDD, is the isolation of 
multiple cDNA species from what appears to be a single 
amplified band. When this occurs, these multiple species 
can produce spurious results when analyzed by reverse 
Northern analyses. For example, if two distinct species 
are isolated, one displaying modified expression and a 
second not displaying modified expression, an accurate 
estimate of differential expression will not be obtained 
by reverse Northern analysis. In this case, a number of 
potential false positives generated using reverse 
Northern analyses, may in reality not be false positives, 
but instead may represent multiple cDNAs. By performing 



single strand conformational polymorphism (SSCP) or 
reverse Northern analyses using cloned cDNA populations 
(33,34) this problem can be ameliorated. 

The expression pattern of representative RSDD-derived 
cDNAs in Ell versus Ell-NMT and in a more expanded 
Ell/Ell-NMT progression cell culture series is shown in 
Figs. 29 and 30, respectively. Reverse Northern results 
correlated well with Northern blots using Ell and Ell-NMT 
("^75% concordance) or a larger panel of cells 
differentially displaying the progression phenotype, 
including progression negative Ell, CREF x Ell-NMT Fl and 
F2, Ell X Ell-NMT A6, Ell x Ell-NMT 3b, Ell-NMT Aza Bl 
and Aza CI cells, and progression positive Ell-NMT, CREF 
X Ell-NMT Rl and R2, Ell x Ell-NMT A6TD, Ell x Ell-NMT 
I la, Ell-Ras R12 and Ell-HPV E6/E7 cells. Sequence 
analysis of the various PEGen cDNAs identified both 
unknown and known genes (Table 3) . Five of 10 PEGen cDNAs 
(50%) were classified as novel sequences since no matches 
were found in current DNA databases. Novel PEGen cDNAs 
include, PEGen 13, 14, 24, 28 and 32. Known PEGen genes 
included PEGen 7 (human papilloma virus-16 early region 
1 binding protein; HPV16 ElBP) , PEGen 8 
(phosphof ructokinase kinase C; PFK-C) , PEGen 21 (a 
fibroblast growth factor-4 inducible gene; FIN 14) , PEGen 
26 (poly ADP-ribose polymerase) and PEGen 30 (rat espl 
homology) In the case of the PSGen cDNAs, six of six 
(100%) were novel, including PSGen 12, 13, 26, 27, 28 and 
29 (Table 3) . 
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Table 3« PEGen and PSGen genes Isolated using RSDD 



I Nomenclature^ 

(%)' 



Identity*" 



Homo logy 



PEGen 7 
PEGen 8 

PEGen 13 
PEGen 14 
PEGen 21 
PEGen 24 
PEGen 26 

PEGen 28 
PEGen 30 
PEGen 32 
PSGen 12 
PSGen 13 
PSGen 26 
PSGen 27 
PSGen 28 



Human HPV16 ElBP 
Rat phospho- 

fructokinase C (PFK-C) 
Unknown 

Unknown 

Murine FIN 14 

Unknovm 

Rat poly ADP-ribose 



polymerase 
Unknown 

Rat espl 

Novel 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 

Unknown 



100 



90 

100 
Novel 

Novel 

94 

Novel 



Novel 
98 

Novel 
Novel 
Novel 
Novel 
Novel 
Novel 
Novel 



^PEGen are progression elevated genes that display 
elevated expression in Ell-NMT versus Ell cells. PSGen 
are progression suppressed genes that display elevated 
expression in Ell versus Ell-NMT cells. 

^Sequences have compared with reported genes in various 
DNA data bases (including GenBank and EMBL) and 
identification with known genes are indicated. Genes 
without homology to currently reported genes are 
indicated as unknown, 

^percentage homology with known sequences, either human, 
rat or mouse is indicated. 

Where no homology exsists the cDNA is considered novel . 
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PEGen 7 is expressed at ~ 4 -fold higher levels in Ell-NMT 
than in Ell cells. PEGen 7 is "98% homologous to 16E1-BP, 
a cDNA encoding a protein identified using the yeast 
5 two-hybrid assay that interacts with human papillomavirus 
type 16 El protein (35) . 16E1-BP encodes a 432aa protein 
of unknown function but does contain an ATPase signature 
motif (Gly-X4-Gly consensus ATP binding motif at aa 179 
through 186). 16E1-BP appears to be a form of TRIP13, a 

10 protein previously shown to bind thyroid hormone receptor 
in yeast two-hybrid assays. The role of PEGen 7/16E1-BP 
in the progression phenotype in the Ell/Ell -NMT 
progression model is not known. Additional studies are 
necessary to determine if this gene change is associative 

15 or causative of transformation progression. 

PEGen 8 is expressed at ~3- to 4- fold higher levels in 
Ell-NMT than in Ell cells. PEGen 8 shows 100% homology to 
rat phosphofructokinase C (PFK-C) (36) . PFK catalyzes the 

20 rate-limiting and committed step in glycolysis, the 
conversion of fructose 6 -phosphate to fructose 
1,6-biphosphate- Three subunit isozymes of PFK have been 
identified, that form homo- and heterotetramers with 
differing catalytic and allosteric properties. PFK-M is 

25 specific for cardiac and skeletal muscle, PFK-L is 
expressed in many, tissues but is most abundant in the 
liver and PFK-C is expressed in several brain regions and 
the anterior pituitary but not in liver, skeletal muscle, 
or several other human tissues. The cDNA of PFK-C 

30 isolated from a rat hypothalamic cDNA library is 2643 bp 
and encodes a protein of 765aa (-36). In a recent study 
Sanchez -Martinez and Aragon (37) , demonstrated that PFK-C 
is the predominant form of PFK in ascites tumor cells 
(obtained from a transplantable mouse carcinoma of 

35 mammary origin) , whereas PFK-L is most abundant in the 
normal mammary gland. These results suggest the 
interesting possibility that PFK-C might contribute to 
the malignant nature of specific target cells. The role 
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presently reported of PEGen 8/PFK-C in progression in the 
Ell/Ell -NMT model remains to be determined. 

PEGen 21 is expressed at '^3- to 4-fold higher levels in 
5 Ell-NMT than in Ell cells. PEGen 21 displays "'98% 
.. homology with the fibroblast growth factor- 4 inducible 
gene FIN- 14 (38) , FIN- 14 is a novel cDNA of unknown 
function that hybridizes with a 4.5 kb mRNA that is 
induced 4-fold in NIH 3T3 mouse cells following treatment 

10 with FGF-4. The induction of FIN-14 occurs late (18 hr) 
after treatment with FGF-4 and does not occur when cells 
are treated for 18 hr with FGF-4 in the presence of 
cycloheximide (38) . These results confirm that FIN-14 
encodes a late-inducible gene. Moreover, nuclear run-on 

15 assays document that FIN-14 is transcriptionally 
activated in NIH 3T3 cells following growth factor 
stimulation. Tissue distribution studies indicate 
expression of a single mRNA species in the kidney with 
low levels of expression observed in several other 

20 tissues including testis and thymus. Mouse erabryogenesis 
studies indicate that FIN-14 expression occurs 
constitutively in mouse embryos between day 10,5 and 
15.5. Unlike NIH 3T3, FIN-14 was constitutively expressed 
in PC12 cells and its level did not vary appreciably in 

25 response to growth factor stimulation. The role of PEGen 
2l/FIN-i4 in progression in Ell/Ell-NMT model system is 
not currently known, 

PEGen 26 is expressed at '"S- to 4 -fold higher levels in 
30 Ell-NMT than in Ell cells. This cDNA is identical to rat 
poly(ADP-ribose) polymerase (PARP) (39) . PARP contributes 
to the ability of eukaryotic cells to contend with both 
environmental and endogenous genotoxic agents (40) . PARP 
is a nuclear enzyme that binds to DNA breaks and then 
35 catalyzes the covalent modification of acceptor proteins 
with poly (ADP-ribose) (39,40). PARP activity contributes 
to the recovery of proliferating cells from DNA damage 
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and to the maintenance of genomic stability, which may be 
regulated by effects on chromatin structure, DNA 
base-excision repair and cell cycle regulation (39,40). 
The role of PEGen 26/PARP in mediating the progression 
5 phenotype is not currently known. However, since cancer 
is a progressive disease characterized by the 
accumulation of genetic alterations in the evolving tumor 
(6), it is tempting to speculate that overexpression of 
PEGen 26/PARP in Ell-NMT may facilitate the ability of 

10 these aggressive cancer cells to maintain genomic 
stability during cancer progression. In this context, 
PEGen 26/PARP may be an integral component of 
progression. This hypothesis is readily testable. PEGen 
30 is expressed at 2- to 3-fold higher levels in Ell-NMT 

15 than in Ell cells. This cDNA displays "'98.5% homology to 
rat espl (41) . Rat espl encodes a 24-kDa nuclear protein 
which is the rat homologue of Drosophila Enhancer of 
split . , a gene involved in ventral ectodermal development 
in Drosophila (41) . PEGen 30 appears to be a homologue of 

20 espl, since the message detected in Ell and Ell-NMT cells 
("'4 kb) is larger in size than the reported espl 
transcript (1.3 kb) (41) . The role of PEGen 30/espl in 
tumor progression in Ell/Ell-NMT model system remains to 
be determined. 

25 

The PSGen cDNAs;, 12, 13, 26, 27, 28 and 29, consist of 
sequences without homology to those in various DNA data 
bases. Expression of PSGen 12 and PSGen 13 cDNAs is "^3- 
to 4 -fold higher in Ell versus Ell-NMT cells (Fig. 29) . 

30 It is not currently known whether these genes simply 
correlate with or functionally regulate the progression 
phenotype. The identification of full-length cDNAs for 
PSGen-12 and PSGen-13, as well as the other novel PSGen 
and PEGen cDNAs, are in progress and once isolated 

35 experiments can be conducted to directly define the role 
of these progression-related genes in cancer progression. 
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Presently demonstrated is a modified gene- identification 
and gene-cloning technique, RSDD, that can efficiently 
identify differentially expressed cDNAs. As predicted, 
subtract ive hybridization prior to differential RNA 
display greatly reduces band complexity, a problem 
encountered in standard DDRT-PCR in which RNA samples are 
directly analyzed without subtraction. Unlike a previous 
report using subtracted cDNAs processed th?:ough 
successive rounds of PGR (25,42), common bands were 
eliminated using reciprocally subtracted cDNA libraries 
that had not been processed using PGR. In addition to 
subtraction hybridization, the discovery ... of 
differentially expressed genes was further streamlined by 
using reverse Northern analyses with isolated cDNAs. With 
3 single anchored oligo dT primers and 18 arbitrary 5 ' 
primers, 72 bands were identified that displayed 
differential expression using reverse Northern analysis. 
Currently, 38 cDNA species have been analyzed by Northern 
blotting and 31 ('"82%) displayed differential expression 
in Ell versus Ell-NMT cells. Sequence analysis of the 
cloned cDNA fragments revealed 16 different genes, 
including 11 novel genes not reported in recent DNA 
databases. RSDD represents a method of choice either as 
a more efficient and less time consuming modification of 
the differential RNA display strategy or as a screening 
methodology for identifying differentially expressed 
genes in reciprocally subtracted cDNA libraries. 
Moreover, the ability of RSDD to identify differentially 
expressed genes that are dissimilar to those recognized 
using standard DDRT-PGR or subtraction hybridization 
indicates that this approach will be a valuable adjunct 
in cloning the complete repertoire of differentially 
expressed gene changes occurring between complex genomes , 
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